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ABSTRACT 



Each small instruction in a VLIW instruction (long 
instruction) is added with the number of NOP instructions 
which succeed the small instruction, and these NOP instruc- 
tions are deleted from the succeeding long instruction. A 
plurality of long instructions are therefore time-compressed. 
Thereafter, a plurality of small instructions in each long 
instruction are divided into a plurality of groups, and a 
combination of operation codes (OP codes) of small instruc- 
tions in each group is replaced by a group code to generate 
a compressed, grouped instruction. Each long instruction is 
therefore space-compressed. An instruction expanding unit 
has an instruction expanding circuit for each grouped 
instruction. Each instruction expanding circuit expands one 
grouped instruction in a long instruction, generates a group 
of small instructions represented by the grouped instruction, 
and supplies the group of generated small instructions to 
respective function units via a decode unit. In this case, each 
instruction expanding circuit supplies after each small 
instruction NOP instructions same in number as that desig- 
nated by a NOP number associated with each small instruc- 
tion in this grouped instruction. 

18 Claims, 15 Drawing Sheets 
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FIG.13B 
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FIG.13C 
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PROCESSOR FOR VLIW INSTRUCTION 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to a processor for expanding 
and executing VLIW (very long instruction word) instruc- 
tions (hereinafter also called a long instruction). 

2. Description of the Related Art 

The performance of a computer is determined by a period 
of one machine cycle and CPI (cycles per instruction) 
indicating the number of machine cycles required for 
executing one instruction. It is essential for improving the 
computer performance to shorten both the machine cycle 
period and CPI. There is a scheme of shortening CPI in 
which a number of instructions are executed during one 
machine cycle in parallel at the same time. One of typical 
examples of this scheme is a VLIW scheme (refer to David 
A Patterson and John L, Hennessy, "Computer Architecture 
A quantitative approach" by Morgan Kaufmann Publishers, 
Inc., 1990). 

In the VLIW scheme, a long instruction containing a 
plurality of instruction fields is used, and each instruction 
field controls a function unit such as a calculation unit and 
a memory unit. One instruction can therefore control a 
plurality of function units. In order to simplify an instruction 
issuing circuit, each instruction field of a VLIW instruction 
is assigned a particular operation or instruction (hereinafter 
called a small operation or instruction). A plurality of small 
instructions of one VLIW instruction can control at the same 
time a plurality type of function units assigned to the small 
instructions. Each small instruction is constituted of an 
operation code (hereinafter called an OP code) representa- 
tive of the type of arithmetic operations and an operand 
representative of the subject of arithmetic operations. With 
the VLIW scheme, in compiling a VLIW instruction, the 
dependency relationship between small instructions of a 
program is taken into consideration to schedule the execu- 
tion order of small instructions and distribute them into a 
plurality of VLIW instructions so as to make each VLIW 
instruction contain concurrently executable small programs 
as many as possible. As a result, a number of small instruc- 
tions in each VLIW instruction can be executed in parallel 
and a computer executing such instructions does not require 
a complicated instruction issuing circuit. It is therefore easy 
to shorten the machine cycle period, to increase the number 
of instructions issued at the same time (hereinafter called an 
instruction parallel degree), and to reduce the number of 
cycles per instruction (CPI). This technique has drawn 
attention as a means of improving the performance of a 
computer. 

In the VLIW scheme, each VLIW instruction contains 
instruction fields corresponding to function units. If there is 
a function unit not used by a VLIW instruction, the instruc- 
tion field corresponding to this function unit is assigned a 
NOP (no operation) instruction indicating no operation. 
Depending on the kind of a program, a number of NOP 
instructions are embedded in a number of VLIW instruc- 
tions. As NOP instructions are embedded in a number of 
instruction fields of VLIW instructions, the number of 
VLIW instructions constituting the program increases. 
Therefore, the main storage and instruction cache are con- 
sumed in storing a large capacity of these VLIW instruc- 
tions. 

Several proposals have been presented in order to reduce 
NOP instructions. For example, in the study reports of 
Information Processing Society of Japan, Vol. 93-ARC-102, 
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pp. 17-24 (hereinafter called the first reference document), 
one or a plurality of consecutive invalid VLIW instructions 
each having only NOP instruction are removed. To this end, 
a field for storing the number of delay cycles of one or a 
plurality of invalid VLIW instructions is stored in the valid 
VLIW instruction to be executed immediately before one or 
a plurality of invalid VLIW instructions. After the preceding 
valid VLIW instruction, the succeeding valid VLIW instruc- 
tion is executed after the lapse of the delay cycles. This 
technique can reduce the number of VLIW instructions so 
that it can be considered as a method of compressing VLIW 
instructions in time. This conventional technique also pro- 
poses a method of improving a use efficiency of function 
units by a multi-thread process which switches a series of 
invalid VOW instructions having all fields filled with NOP 
instructions to another instruction series. 

Another method of compressing VLIW instructions in 
time has been proposed in the study reports of Information 
Processing Society of Japan, Vol. 94-ARC-107, pp. 113-120 
(hereinafter called the second reference document) or in the 
papers of "Parallel Processing Symposium JSPP J 92", pp. 
265-272 (hereinafter called the third reference document). 
In this technique disclosed, if a small instruction in each 
VLIW instruction is a NOP instruction, this NOP instruction 
itself is deleted. More specifically, each valid small instruc- 
tion of a VLIW instruction is provided with a field for 
storing the number of NOP instructions (hereinafter also 
called a NOP number). The number of NOP instructions to 
be executed by the function unit assigned to a valid small 
instruction is stored in this field, and one or a plurality of 
NOP instructions contained in one or a plurality of consecu- 
tive VLIW instructions before the valid small instruction are 
deleted. Namely, after the valid small instructions contained 
in preceding VLIW instructions are executed by the function 
unit, the execution of the next valid small instruction is 
delayed by the number of cycles determined by the NOP 
number. With this method, preceding NOP instructions can 
be deleted for each instruction field. Therefore, the total 
numbers of NOP instructions and VLIW instructions can be 
reduced more than the first reference document. 
Furthermore, since only the number of deleted NOP instruc- 
tions is stored in place of the NOP instructions themselves, 
the length of each VLIW instruction increases not so much. 
Therefore, the capacity of a program constituted such VLIW 
instructions can be reduced considerably as compared to the 
system not adopting this technique. 

According to the technique disclosed in JP-A-7-105003 
(hereinafter called the fourth reference document), a series 
of VLIW instructions containing other instructions in addi- 
tion to NOP instructions is compressed and stored in a main 
storage or the like. The series of stored VLIW instruction is 
expanded and executed. Specifically, each of VLIW instruc- 
tions in a program having different structures is replaced by 
a code sequence with a variable length, and each of VLIW 
instructions having the same structure is replaced by a 
predetermined code sequence. This compressed program 
made of a plurality of such code sequence is stored in a main 
storage. A plurality of non-compressed VLIW instructions 
corresponding to the plurality of code sequence of the 
compressed program are stored in an instruction decode 
memory provided separately from the main storage. In 
executing the compressed program, each non-compressed 
VLIW instruction of the corresponding code sequence of the 
compressed program is read from the instruction decode 
memory and executed. In order to determine the code 
sequence, the VLIW instruction having the same structure is 
required to have the same OP code and operand value in 
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each small instruction. With this technique, each VLIW code, and outputs the group of generating OP codes in 

instruction is replaced by a code sequence having a shorter parallel. The small instruction generating unit generates a 

length so that this technique can be considered as a method group of small instructions by combining each of the group 

of compressing VLIW instructions in space. of output OP codes with one of the group of operand 

As compared with the conventional techniques described 5 information sets, and outputs in parallel the group of gen- 

in the first to third reference documents, the conventional erated small instructions to the corresponding one function 

technique described in the fourth reference document is unit group corresponding to each instruction expanding 

expected to generate a smaller program because a series of circuit. 

VLIW instructions containing other instructions in addition According to an embodiment of the processor of this 

to NOP instructions is compressed. However, this technique 10 invention, each instruction field of each compressed long 

described in the fourth reference document discriminates the instruction has a group of NOP numbers each corresponding 

structure of a VLIW instruction while considering even the to each of the group of small instructions represented by the 

operand field in the small instruction used for designating a compressed group code in the compressed, grouped instruc- 

register and the like. Therefore, the number of VLIW tion in each instruction field. Each NOP number represents 

instructions in a program judged as having the same struc- 15 me number of NOP instructions to be executed by one 

ture is not so large, and there is a possibility that the function unit which executed one small instruction corre- 

compression factor becomes not so large. sponding to each NOP number. 

SUMMARY OF THE INVENTION ^ smaU mstruction generating unit of each instruction 

expanding circuit has a group of small instruction generating 

It is therefore an object of the present invention to provide 20 circuits provided in correspondence with a plurality of 

a processor capable of executing a series of VLIW instruc- function units in each function unit group corresponding to 

tions compressed at a higher compression factor. each instruction expanding circuit. 

In order to achieve the above object, a compressed VLIW Each instruction generating circuit includes a small 

instruction used by a VLIW instruction processor of this ^ instruction selector circuit and a control circuit. The small 

invention has a plurality instruction fields corresponding to instruction selector circuit generates one small instruction by 

each of a plurality of function unit groups, and each instruc- combining one of the group of OP codes generated by the 

tion field includes a compressed, grouped instruction repre- group code expanding circuit of each instruction expanding 

sentative of a group of small instructions to be executed in circuit, with respect to the compressed group code in one 

parallel by each function unit corresponding to each instruc- ^ grouped instruction in one long instruction supplied from the 

tion field, the compressed, grouped instruction being repre- long instruction supply circuit, with one of the group of 

sented by an amount of information smaller than an amount operand information sets in the one grouped instruction, and 

of information of the group of small instructions, supplies the generated one small instruction to one function 

The VLIW instruction processor of this invention unit corresponding to each small instruction generating 
includes a plurality of function units divided into a plurality 35 circuit. After the one small instruction is supplied, NOP 
of function unit groups. The processor includes an instruc- instructions same in number as that designated by one of the 
tion expanding unit for expanding a VLIW instruction, the group of NOP numbers are sequentially supplied to the 
instruction expanding unit has a plurality of instruction function unit corresponding to each small instruction gen- 
expanding circuits operating in parallel and provided for erating circuit. The control circuit inhibits each small 
each of the plurality of function unit groups. 4Q instruction generating circuit to supply a small instruction to 

Each mstruction expanding circuit expands a compressed, the corresponding function unit, the small instruction being 
grouped instruction in a long instruction supplied from an supplied from each instruction expanding circuit after the 
instruction supply unit, the compressed, grouped instruction group of OP codes and containing one of a group of 
contained in one instruction field corresponding to one succeeding OP codes generated from a long instruction 
function unit group corresponding to each instruction 45 succeeding the one long instruction, until the NOP instruc- 
expanding circuit, generates a group of small instructions lions same in number as that represented by the one NOP 
each to be executed by each function unit of the correspond- number are supplied completely, 
ing one function unit, and outputs in parallel the generated 
group of small instructions to the corresponding one func- 
tion unit group. 50 FIG. 1 is a schematic diagram showing the structure of a 

More specifically, the compressed, grouped instruction VLIW instruction processor according to an embodiment of 

includes a group code representative of a group of OP codes the invention. 

necessary for each of the group of small instructions repre- pjc 2A is a diagram showing the format of a VLIW 

sented by the grouped instruction, the compressed, grouped instruction not compressed. 

instruction being represented by an amount of information 55 mQ 2B ^ a & showi me format of a yrjw 

smaller than an amount of ^formation of the group of OP compressed in time. 

codes, and a group of operand mformatioD sets representa- M ^ , f , ^ * « , 

tive of at least one operand necessary for one of the group HG - 2C 15 a ^8™ s u howin S f ormat of a 

of small instructions. instruction compressed both in time and space. 

Each instruction expanding circuit has a group code 60 mG 2D * a showin g * e f f mat of a . VLIW 

expanding circuit and a small instruction generating unit. instruction containing an instruction for preloading an 

The group code expanding circuit expands a compressed instruction table. 

group code in a long instruction supplied from the instruc- FIG. 3 is a circuit diagram briefly showing an instruction 

tion supply unit, the compressed group code being contained expanding circuit to be used by the processor shown in FIG. 

in an instruction field corresponding to each instruction 65 1- 

expanding circuit, generates a group of OP codes necessary FIG. 4 is a circuit diagram briefly showing a group code 

for the group of small instructions represented by the group expanding circuit to be used by the circuit shown in FIG. 3. 
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FIG. 5 is a circuit diagram briefly showing a small 
instruction generating circuit to be used by the circuit shown 
in FIG. 4. 

FIG. 6 is a circuit diagram briefly showing another 
example of the group code expanding circuit to be used by 
the VOW processor of this invention. 

FIG. 7 is a circuit diagram briefly showing still another 
example of the group code expanding circuit to be used by 
the VUW processor of this invention. 

FIG. 8 is a diagram showing an example of a FORTRAN 
program illustrating the processes to be executed by the 
processor shown in FIG. 1. 

FIG. 9 is a diagram showing an example of the FOR- 
TRAN program shown in FIG. 8 written by an assembly 
language for a virtual processor 

FIG. 10 is a diagram showing an example of a program 
obtained by scheduling the assembly language program 
shown in FIG. 9. 

FIG. 11 is a diagram showing a program expressed by a 
series of long instructions having the format shown in FIG. 
2A, the program executing the processes expressed by a 
series of instructions shown in FIG. 10. 

FIG. 12Ais a diagram showing an example of a load/store 
instruction table. 

FIG. 12B is a diagram showing an example of a floating 
point calculation instruction table. 

FIG. 12C is a diagram showing an example of a fixed 
point calculation instruction table. 

FIG. 12D is a diagram showing an example of the 
program shown in FIG. 11 and expressed by a series of long 
instructions having the format shown in FIG. 2C. 

FIGS. 13A, 13B and 13 C are diagrams illustrating the 
execution of the program shown in FIG. 12D. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENTS 

An embodiment of a VLIW instruction processor of this 
invention will be described in detail with reference to the 
accompanying drawings. In the following, identical or simi- 
lar components are represented by using same reference 
numerals. 

(1) Outline of structure 

Referring to FIG. 1, a VLIW processor system has a main 
storage 100 and a VLIW processor 200 connected to the 
main storage. The processor 200 has a plurality of function 
units which execute in parallel a plurality of small instruc- 
tions in one VLIW instruction. In this embodiment, the 
function units include: load/store (LD/ST) units 204-1 and 
204-2 for executing a small instruction of load or store; 
floating point calculation units 205-1 and 205-2 for execut- 
ing a small instruction of floating point calculation; fixed 
point calculation units 206-1 and 206-2 for executing a small 
instruction of fixed point calculation; and a branch unit 207 
for executing a small instruction of branch. In the following 
description, a small instruction of load and a small instruc- 
tion of store are also called simply a load instruction and a 
store instruction, and these small instructions are also called 
simply a load/store instruction or an L/S or LD/ST instruc- 
tion. Similarly, a small instruction of floating point calcu- 
lation is also called simply a floating point calculation 
instruction or an FL instruction. Similarly, a small instruc- 
tion of fixed point calculation is also called simply a fixed 
point calculation instruction or an FX instruction. Similarly, 
a small instruction of branch is also called simply a branch 
instruction. 
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The main storage 100 stores therein programs constituted 
of a plurality of compressed VLIW instructions. An instruc- 
tion fetch unit 202 sequentially fetches compressed VLIW 
instructions from an instruction cache 201 or from the main 

5 storage 100. An instruction expanding unit 300 expands 
each fetched, compressed VOW instruction, and generates 
a non-compressed VLIW instruction composed of a plurality 
of small instructions. This instruction expanding unit is a 
circuit characteristic to this embodiment. 

10 In this embodiment, a plurality of VLIW instructions are 
compressed in time to reduce the number of NOP instruc- 
tions contained in these VLIW instructions. A VLIW instruc- 
tion obtained in this manner is called a time -compressed 
VLIW instruction. Each time-compressed VLIW instruction 

lf is further compressed in space. For this space compression, 
a plurality of small instructions constituting each time- 
compressed VLIW instruction are divided into a plurality of 
groups, and a plurality of small instructions of each group 
are compressed to generate a new compressed, grouped 

2Q instruction. In this manner, each time-compressed VLIW 
instruction is converted into a plurality of compressed, 
grouped instructions. A VLIW instruction after this conver- 
sion is also called a space-compressed VLIW instruction. 
The instruction expanding unit 300 generates a non- 
25 compressed VLIW instruction by expanding each space- 
compressed VLIW instruction. There are provided a plural- 
ity of instruction expanding circuits 300a, 300b and 300c 
each generating a non-compressed VLIW instruction by 
expanding in time and space one of a plurality of 

30 compressed, grouped instructions constituting the space- 
compressed VLIW instruction. In this embodiment, the 
branch instruction is compressed in time but not in space. An 
instruction expanding circuit 300^ is provided for time- 
expanding such a time-compressed branch instruction. Only 

35 in this point, the instruction expanding circuit 300d is 
different from the other instruction expanding circuits 300a, 
330b and 300c. However, in the following description, these 
instruction expanding circuits 300a-300rf will not be dis- 
criminated therebetween where appropriate. 

40 A decode unit 203 analyzes each of a plurality of small 
instructions constituting the non-compressed VLIW instruc- 
tion supplied from the instruction expanding unit 300 via 
one of signal lines 405-408, and supplies the analyzed small 
instruction to a corresponding one of the function units 

45 including the LD/ST units 204-1 and 204-2, floating point 
calculation units 205-1 and 205-2, fixed point calculation 
units 206-1 and 206-2, and branch unit 207. This decode 
circuit 203 is well known. 

Each of the function units 204-1, 204-2, 205-1, 205-2, 

50 206-1, 206-2, and 207 performs an operation designated by 
the small instruction supplied from the decode unit 203, by 
using a register group (hereinafter called a GR/FR register 
group) 208 and an internal bus 416. The GR/FR register 
group has a plurality of general registers (GR) and a plurality 

55 of floating point registers (FR). Of these function units, the 
LD/ST unit 204-1 or 204-2 loads or stores data by control- 
ling the GR/FR register group 208 and a data cache 209 via 
the internal bus 416, when it receives a small instruction for 
requesting to load data stored in the main storage 100 or 

60 when it receives a small instruction for requesting to write 
data in the main storage 100. The floating point calculation 
unit 205-1 or 205-2 performs a register calculation by using 
the GR/FR register group 208 when it receives a floating 
point calculation small instruction. The fixed point calcula- 

65 tion unit 206-1 or 206-2 performs a register calculation by 
using the GR/FR register group 208 when it receives a fixed 
point calculation small instruction. The branch unit 207 
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executes a branch operation designated by a branch instruc- 
tion when it receives the branch instruction. If the branch has 
a different branch direction from a predicted one, a notice to 
the effect that the branch direction is different is supplied via 
a signal line 415 to a corresponding one of the instruction 
expanding circuits 300a-300<i 

(2) Compression method of VLIW instruction 

FIG. 2 A shows the format of a non -compressed VLIW 
instruction 100 used by the embodiment. This instruction 
has seven instruction fields 1, 2, . . . , 6 and 7 for setting small 
instructions each being executed by one of the seven func- 
tion units shown in FIG. 1. A small instruction set to each 
instruction field is a valid instruction or a NOP instruction. 
In this embodiment, these instruction fields 1-7 are assigned 
with seven small instructions to be executed by the LD/ST 
units 204-1 and 204-2, floating point calculation units 205-1 
and 205-2, fixed point calculation units 206-1 and 206-2, 
and branch unit 207. Each valid small instruction has a 
sub-field 10 for setting an OP code (OPC) representative of 
the kind of operation and sub-fields 11a and 116 for setting 
operands OPD1 and OPD2 used for the operation indicated 
by the small instruction. 

FIG. 2B shows the format of a VLIW instruction 100a 
obtained when a plurality of VLIW instructions having the 
format of FIG. 2 A are compressed in time. A valid small 
instruction or a NOP instruction is set to each of instruction 
fields la, 2a, , . . , 6a and la of this VLIW instruction. Each 
small instruction has an additional NOP number sub-field 12 
which is not provided in the VLIW instruction shown in 
FIG. 2A. This NOP number sub-field is provided even if the 
small instruction is a NOP instruction. The NOP sub -field 12 
of each small instruction is set with the number of NOP 
instructions which are to be used, after the execution of this 
small instruction and before the execution of a next valid 
small instruction contained in the same instruction field of 
the succeeding VLIW instruction. Namely, the number of 
NOP instructions corresponds to the number of execution 
delay cycles of the next valid small instruction. 

Specifically, if there are NOP instructions in the same 
instruction fields of the succeeding non-compressed VLIW 
instructions, the number of NOP instructions is stored in the 
NOP number sub-field of the valid small instruction in the 
VLIW instruction immediately before the VLIW instruc- 
tions containing NOP instructions. In other words, in place 
of a plurality of NOP instructions contained in different 
VLIW instructions, one NOP number is set and the NOP 
instructions are deleted from the VLIW instructions. By 
using the NOP number, small instructions in the instruction 
fields of consecutive VLIW instructions are packed and the 
number of VLIW instructions can be reduced. In this 
embodiment, therefore, this process is called time compres- 
sioa 

FIG. 2C shows the format of a' VLIW instruction 1006 
obtained by compressing in space one time-compressed 
VLIW instruction having the format shown in FIG. 2B. A 
grouped instruction is set to each of instruction fields 21, 34, 
56 and la of this VLIW instruction 1006. For this space 
compression, a plurality of small instructions in one time- 
compressed VLIW instruction is divided into a plurality of 
groups each constituted of a plurality of small instructions, 
and a plurality of small instructions contained in each group 
are compressed to generate a grouped instruction represen- 
tative of a group of small instructions. Specifically, in place 
of OP codes of each group, a group code is set which 
unanimously determines a combination of the OP codes. 
This group code has a length shorter than the total length of 
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the OP codes. Therefore, this group code is a compressed 
code of the OP codes of small instructioas. The operand 
information of these small instructions.is set in the grouped 
instruction. 

In this manner, a grouped instruction for a group of small 
instructions is generated, the grouped instruction including 
one compressed group code and a plurality of information 
sets of the small instructions. In addition, the length of the 
NOP number sub-field of each of a plurality of small 
instructions of the grouped instruction is shortened properly. 
Therefore, the length of a VLIW instruction having a plu- 
rality of these grouped instructions becomes shorter than the 
VLIW instruction compressed only in time and not in space. 
This newly generated VLIW instruction is also called a 
space-compressed VLIW instruction where applicable. 

In this embodiment, for example, the two small instruc- 
tions in the load/store instruction fields la and 2a shown in 
FIG. 2B are compressed to generate a new grouped instruc- 
tion (FIG. 2C). For example, the field 21 of the grouped 
instruction shown in FIG. 2C includes: a sub-field 10A for 
setting a group code obtained by compressing the OP codes 
of the small instructions la and 2a; an operand sub-field 11A 
for setting the operands 11a and 116 of the small instruction 
la; a sub-field 12A for setting the NOP number of the small 
instruction la; an operand sub-field 11B for setting the 
operands 11a and lib of the small instruction 2a; and a 
sub-field 12B for setting the NOP number of the small 
instruction 2a. 

The field 21 shown in FIG. 2C is set with a grouped 
instruction generated in the above manner. Similarly, two 
small instructions in the floating point calculation instruc- 
tion fields 3a and 4a are compressed in space to generate a 
new grouped instruction. A field 34 shown in FIG. 2C is set 
with a grouped instruction generated in this manner. 
Similarly, two small instructions in the fixed point calcula- 
tion instruction fields 5a and 6a are compressed in space to 
generate a new grouped instruction. Afield 56 shown in FIG. 
2C is set with a grouped instruction generated in this 
manner. 

In this embodiment, as will be later detailed, at least one 
instruction table is used. This instruction table stores a 
plurality combination of OP codes of two small instructions 
and a compressed group code representative of each com- 
bination. The group code is used as a pointer to the location, 
in the instruction table, of each OP code combination of two 
small instructions. This group code is used to refer to the 
instruction table when two small instructions of the time- 
compressed VLIW instruction are compressed in space. The 
group code is set to the sub-field 10a of each of the fields 21, 
34 and 56. 

In this embodiment, in the space -compressed VLIW 1006 
shown in FIG. 2C, the branch instruction 7a has no other 
small instructions to be compressed in space. Therefore, the 
branch instruction lb is not compressed in space, but the 
branch instruction lb itself shown in FIG. 2B is set to the 
space-compressed VLIW instruction 1006 shown in FIG. 
2C. 

In this embodiment, a first program written by non- 
compressed VLIW instructions shown in FIG. 2 A is gener- 
ated from a source program. A second program written by 
the time-compressed VLIW instructions shown in FIG. 2B 
is generated by time-compressing the first program. A third 
program written by the space-compressed VLIW instruc- 
tions shown in FIG. 2C is generated by space-compressing 
the second program. This third program is used as a program 
to be executed. The compression sequence will be described 
by using a particular program. 
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FIG. 8 shows a benchmark program which is obtained by 
expanding a famous Rivermore kernel — kernel loop No. 1 
by a fourfold. It is assumed that a program shown in FIG. 9 
is obtained by coding this benchmark program by the 
assembler language of the processor structured as shown io 
FIG. 1. A number enclosed between parenthesis at the left 
side of FIG. 9 is a serial number of each instruction of the 
program. 

In this program shown in FIG. 9, LD of the first to fourth 
instructions represents a load instruction. With this load 
instruction, data in the main storage at an address (an 
address designated by a (K+10)th element of an array Z or 
by a variable T, R or Q) designated by the second operand 
of this instruction is loaded into a register (the 26th, 1st, 2nd 
or 3rd floating point register FR26, FR1, FR2 or FR3) 
designated by the first operand. 

MOVE of the fifth instruction is a data move instruction. 
With this data move instruction, the contents of the register 
(the 26th floating point register FR26) represented by the 
second operand are copied to the register (the 4th floating 
point register (FR4)) represented by the first operand. 

FMULof the 7th, 11th, 15th, 16th, 19th, 23rd, 27th, 31st, 
32nd and 35th instructions is a floating point multiplication 
instruction. With this instruction, a product of the contents of 
the two floating point registers designated by the second and 
third operands is stored in the floating point register desig- 
nated by the first operand. 

FADD of the 12th, 20th, 28th and 36th is a floating point 
addition instruction. With this instruction, an addition of the 
contents of the two floating point registers designated by the 
second and third operands is stored in the floating point 
register designated by the first operand. 

LDU of the 10th, 14th, 18th, 22nd, 26th, 30th and 34th is 
a load up instruction. With this instruction, similar to the 
load instruction, data in the main storage at the address 
designated by the second operand is loaded in the register 
designated by the first operand, and different from the load 
instruction, after the data is loaded, the value of a register 
(not shown) holding the address of the second operand is 
incremented by a designated amount of addresses. 

With the 38th branch instruction, the operation is 
branched to the instruction (fifth instruction) affixed with a 
label LOOP as many repetition times as set in the 0th genera] 
register (in the case of this embodiment). In FIG. 10, an 
instruction to preset the repetition times to the general 
register is omitted. The program shown in FIG. 9 executes 
the first to fourth instructions and thereafter repetitively 
executes the instructions from the fifth instruction affixed 
with the label LOOP to the last branch instruction. 50 

A series of instructions shown in FIG. 9 is scheduled 
properly to obtain a series of instructions shown in FIG. 10. 
This scheduling is performed so that the processes of the 
series of instructions (1) to (38) shown in FIG. 9 are 
executed by the processor shown in FIG. 1 as a series of 55 
non-compressed VLIW instructions having the format 
shown in FIG. 2A and that' the time required for the 
execution completion of the instructions is shortened. It is 
assumed that the number of machine cycles (load latency) is 
"6" which is required for the data to become usable after the eo 
processor shown in FIG. 1 executes the load instruction and 
that the number of machine cycles (latency) is "2" which is 
required for the calculation results to become usable after a 
calculation instruction such as a multiplication instruction 
and an addition instruction is executed. 

In FIG. 10, each row indicates one non-compressed 
VLIW instruction. The VLIW instruction at an upper posi- 
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tion is executed earlier. Information contained in each small 
instruction is in conformity with the format shown in FIG. 
2 A. In the instruction series shown in FIG. 9, no fixed point 
calculation is contained so that all fields of the two fixed 
point columns of VLIW instructions shown in FIG. 10 are 
filled with NOP instructions. 

The VLIW instructions executing the processes requested 
by the first to fourth instructions in the area above the double 
line in FIG. 9 are executed only once. The VLIW instruction 
series in the area below the double line executing the 
processes requested by the fifth and following instructions 
are executed repetitively. This discrimination by the double 
line is also applied to FIGS. 11 and 13. As seen from FIG. 
10, he first to fourth instructions shown in FIG. 9 are 
executed by the first to sixth VOW instructions shown in 
FIG. 10, whereas the fifth and following instructions shown 
in FIG. 9 are executed by the seventh and following VOW 
instructions shown in FIG. 10. 

FIG. 11 shows a series of VLIW instructions obtained by 
time-compressing the VLIW instructions shown in FIG. 10. 
Each of the VOW instructions shown in FIG. 11 has the 
format shown in FIG. 2B. A NOP number sub-field is added 
to each small instruction in each VOW instruction shown in 
FIG. 10. The number of NOP instructions after each small 
instruction is set to this NOP number sub-field, and the 
succeeding NOP instructions are deleted to thereby obtain 
the VOW instruction shown in FIG. 11. This NOP number 
sub-field is represented in FIG. 11 by parenthesis and a 
number between them. For example, the third to sixth NOP 
instructions in the left two columns of FIG. 10 are deleted, 
and the number "4** of deleted NOP instructions is set to the 
NOP number sub -fie Ids of the two second load instructions 
in FIG. 11. 

In this embodiment, the NOP number sub -field is assumed 
as having a 3-bit length. Therefore, the number of succeed- 
ing NOP instructions, only "Q" to U T\ can be set to one NOP 
number sub -field, and all these succeeding NOP instructions 
can be deleted. If the number of succeeding NOP instruc- 
tions is "8" or larger, a value "7" is set to the NOP number 
sub -field of this small instruction, and seven NOP instruc- 
tions among the succeeding NOP instructions are deleted. 
The remaining NOP instructions not deleted are set, at a 
maximum of "1", to the NOP number sub-field of the next 
VLIW instruction. If the remaining NOP instructions is in 
excess of "7", the similar process as above is repeated. 

Even if the number of succeeding NOP instructions is "7" 
or smaller, all NOP instructions are not deleted in some 
cases. For example, the small instructions in the first to sixth 
VLIW instructions for the first floating point calculation are 
all NOP instructions and the total number of NOP instruc- 
tions is "6". The small instructions in the first to sixth VLIW 
instructions for the second floating point calculation are also 
all NOP instructions and the total number is 

Although the total number of these NOP instructions is 
smaller than "7", the first to sixth NOP instructions for the 
floating point calculation shown in FIG. 10 are distributed 
into two VLIW instructions as shown in FIG. 11 because the 
first and second VLIW instructions of FIG. 11 are essential 
for executing the processes requested by the two VLIW 
instructions as stated earlier. In this example, the first NOP 
number field is set to "4" among the six NOP instructions of 
the first to sixth VLIW instructions for the first floating point 
calculation shown in FIG. 10. In this example, "4" is set to 
the NOP number sub-field of the first VLIW instruction and 
"0" is set to the NOP number sub-field of the second VLIW 
instruction. This setting depends on an algorithm adopted in 
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this embodiment. If another algorithm is adopted, for (instruction fields) occurs because of time compression, one 

example, "2" may be set for each of the first and second function unit is assigned a valid small instruction whereas 

VLIW instructions. another function unit is not assigned a valid small instruc- 

As above, even if the total number of consecutive NOP tion. In the VLIW computer, small instructions for all 

instructions in the same fields of different VLIW instructions 5 function units are fetched collectively so that it is not 

is smaller than "T, the NOP instructions are not deleted possible to fetch only valid small instructions in one VUW 

more than the total number of time-compressed VLIW instruction. It is therefore necessary to assign a function unit 

instructions necessary for the execution of the processes having no valid instruction with an invalid small 

required by the valid small instructions in the time- instruction. If the NOP instruction is used for this assigned 

compressed VLIW instructions containing such NOP 10 mva lid smaU ir^truction, the execution among function units 

instructions. shifts, particularly in the case of repetitive processes, 

Each of the time-compressed VUW instruction shown in because the NOP instruction requires one cycle execution 

FIG. 11 is space-compressed to obtain a series of instruc- ume From this reason, not the NOP instruction but the 

tions shown in FIG. 12. For this space compression, a invalid instruction is used. The invalid instruction is inserted 

plurality of small instructions constituting each VLIW 15 ^ i n the foUowing. Compiled program codes are divided at 

instruction shown in FIG. 11 are divided into a plurality of a branch destination of the branch instruction, i.e., at the 

groups, and a plurality of small instructions of each group is labeled location, and time compression is conducted for 

space-compressed to thereby space-compress the time- each co ii ec tion of divided codes (this collection is called 

compressed VLIW instruction. In this embodiment, each hereinafter a block). If there is a usage difference between 

time-compressed VLIW instruction is divided into small M fu nction units used in each block, invalid instructions are 

grouped instructions each composed of two small added to t h e end of an instruction field so that the number of 

instructions, including a pair of ITS instructions, a pair of FL instructions becomes the same as the other instruction field 

instructions, and a pair of FX instructions, and into one small having the largest valid instructions, 

grouped instruction composed of the branch instruction. In [q nG n ^ codes for ^ ^ ^ mstruction 

addition, an instruction table is prepared which stores a M m pp £ inyalid mstructions are inserted, 

plurality combination of OP codes the small instructions of m ^ embodiment mat me effective small 

each group can take, and a compressed group code repre- ^ m ^ ^ Tq ^ instruction fie]d shown m 

sentative ot each combination is used. l*c group code is F[G _ ^ bmich 

instruction itself contained in the 

used as a pointer to the location, in the mstruction table, 01 t . , ,™ m , . , - . XT , 4k . 

. , . . . „ ' „ . 4 . ' time-compressed VLIW instruction is set. Namely, this 

each OP code combination of two small instructions. 30 branch fe not compressed m space although it is 

FIGS. 12A, 12B and 12C show mstruction tables 30A, compre ssed in time. 
30B and 30C corresponding to the L/S instruction group, FL 

- . j ™ . ° tx. ' • t. 4- In this embodiment, therefore, the time-compressed 

instruction group and FX instruction group. This instruction . . ' - ™ im a 1 i-« 

* ui„ Z, tUa • c , t __ t - „ t' U \/ia\ tu» t /c VLIW instruction series shown in FIG. 12 D and a plurality 

table, for example, the instruction table 30A for the L/S . „ *1„ J 

instruction group, is formed by deriving different combina- 35 of infraction tables 30A^J0C shown m FIGS. 12A-12C are 

tions of Opcodes contained in the two time-compressed IVS used for J™* 0 * *» P 10 ^ sho ™ m "A* P ,Tr 

instruction columns. The instruction tables for the FL 10 th^ program the instrucUon Ubles 30A-30C 

* j j. j ™ - . r j • ii are read from the main storage 100 and supplied to the 
instruction group and FX mstruction group are formed in the . . ... . f rtA i* ft n v u 

JL oc ,u mro Ac ™- a.™ ctpo i-> a i^n mstruction expanding circuits 300a, 3006, and 300c which 

same manner as above. As seen from FIGS. 12A, 12a and - , * j • 4 a - ^ j - * 

iio .u • * * ui an * f *l t /o • * *• , Iben expand the grouped mstructions. lo this end, an lnstruc- 

12C, the mstruction table 30A for the L/S instruction group 40 . - * . • ' * 

, - , f . . - . .. . . , lft n tion for loading the mstruction tables is executed. This 

is composed of only four entries, the instruction table 30B . .„ » , A , 

c *u • * *■ - i r x; * • ««j mstruction will be later described, 
for the rL instruction group is composed or live entries, and 

the instruction table 30C for the FX instruction group is P) Expansion of VLIW instructions, 
composed of one entry of a combination of NOP and NOP. The VLIW instruction series 40 and the instruction tables 
In this embodiment, a row number written at the side of each 45 30A-30C are stored in the main storage 100. Referring to 
instruction table is used as the group code which stands for FIG. 1, the instruction fetch unit 202 sequentially fetches the 
the combination of OP codes at each row. space-compressed VLIW instruction series 40 (FIG. 12D) 
FIG. 12 shows the space-compressed VLIW mstructions from the instruction cache 201 or the main storage 100. The 
generated from the VLIW instructions shown in FIG. 11 by instruction expanding unit 300 expands the fetched instruc- 
ting the group codes. The group code lOAfor each grouped 50 &ons by using the instruction tables 30A-30C, and supplies 
instruction is written at the leftmost column of each L/S the expanded instructions to the decode unit 203. In this 
instruction group. The group code for the FL instruction embodiment, the instruction tables 30A-30C are loaded in 
group is written in the same manner. The seventh L/S advance from the main storage 100 by the instruction 
instruction group is constituted of NOP instructions. The expanding unit 300 by using the method to be described 
reason why the time-compressed VLIW instructions contain 55 l ater - 

NOP instructions is the same as described with the time- The instruction expanding unit 300 expands each of a 
compressed VLIW instructions shown in FIG. 11 . The group plurality of grouped instructions contained in each time- and 
code for the tenth and following L/S instruction group is space-compressed VUW instruction into a plurality of small 
"FF\ A small instruction having this code is an invalid instructions not compressed both in time and space. The 
instruction which is not executed although it is fetched. In 60 instruction expanding unit 300 has a plurality of instruction 
the following, such a small instruction is called an invalid expanding circuits for expanding each of a plurality of 
instruction or simply an FF instruction. The NOP instruction grouped instructions contained in each time- and space- 
is executed in one machine cycle. Therefore, the invalid compressed VLIW instruction. In this embodiment, there are 
instruction is different from the NOP instruction in that the provided an instruction expanding circuit 300a for LD/ST 
former is not executed. 65 instructions, an instruction expanding circuit 300Z? for float- 
The reason why such an invalid instruction is used is as ing point instructions (FL instructions), an instruction 
follows. If a usage difference between function units expanding circuit 300c for fixed point instmctions (FX 
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instructions), and an instruction expanding circuit 300c for 
the branch instruction. Of these instruction expanding 
circuits, the instruction expanding circuits 300a-300c 
expand an instruction compressed both in time nor space, 
whereas the instruction expanding circuit 300a" expands the 
branch instruction not compressed in space but compressed 
in time. The instruction expanding circuits 300a-300c have 
the same structure with respect to the expansion of an 
instruction compressed both in time and space. Since the 
instruction expanding circuit 3>00d expands the branch 
instruction not compressed in space but compressed in time, 
it has the structure more simple than those of the instruction 
expanding circuits 300a-300c. The instruction fetch unit 
202 has a circuit (not shown) for distributing in parallel a 
plurality of grouped instructions in a fetched instruction 
compressed both in time and space to the instruction 
expanding circuits 300a-300o* via a line 403. In this 
embodiment, the branch instruction is fetched in accordance 
with branch prediction. A circuit for such prediction, a 
circuit for activating to fetch a branch destination instruction 
based on the prediction, and other circuits are not shown and 
omitted for simplicity purpose. 

FIG. 3 shows an example of the instruction expanding 
circuit 300a of the processor shown in FIG. 1. Referring to 
FIG. 3, the instruction expanding circuit 300a has a plurality 
of queues each for storing a corresponding one of a plurality 
of sub-fields of a compressed, grouped instruction supplied 
via the line 403 from the instruction fetch unit 202. More 
particularly, a group code queue 304 receives the group code 
contained in the sub -fie Id 10a of the compressed, grouped 
instruction via a line 403-1, and an operand queue 305-1 
receives a plurality of operands OPD1 and OPD2 contained 
in the sub -field 11 A of the compressed, grouped instruction 
via a line 403-2. ANOP number queue 306-1 receives a NOP 
number contained in the sub-field 12A of the compressed, 
grouped instruction via a line 403-4, and an operand queue 
305-2 receives a plurality of operands OPD1 and OPD2 
contained in the sub-field 11B of the compressed, grouped 
instruction via a line 403-3. 2. A NOP number queue 306-2 
receives a NOP number contained in the sub-field 12B of the 
compressed instruction via a line 403-5. These queues are 
first-in first-out queues which first output first received 
information. A comparator 308 and an FF register 309 are 
used for inhibiting the group code queue 304 or the like to 
receive information contained in small instructions of a 
grouped instruction supplied from the instruction fetch unit 
202 if this grouped instruction is the invalid instruction (FF 
instruction). 

The instruction expanding circuit 300a has a group code 
expanding loping circuit 301 which expands the group code 
output from the group code queue 304 into a corresponding 
combination of OP codes. The instruction expanding circuit 
300a has non-compressed OP code queues 302-1 and 302-2 
for storing a combination of non-compressed OP codes read 
from the group code expanding circuit 301, and has a small 
instruction generating unit 33 for generating a plurality of 
small instructions in accordance with a plurality of OP codes 
output from the OP code queues 302-1 and 302-2 and 
outputting them to a plurality of function units correspond- 
ing to the instruction expanding circuit 300a. This small 
instruction generating unit 33 has instruction generating 
circuits 303-1 and 303-2 provided in correspondence with 
the OP code queues 302-1 and 302-2. The small instruction 
generating circuit 303-1 either outputs a small instruction at 
a line 501 composed of the OP code output from the OP code 
queue 303-a and the operand output from the operand queue 
305-1, or a NOP instruction to the corresponding function 
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unit, in accordance with the NOP number supplied from the 
NOP number queue 306-1. After a small instruction on the 
line 50-1 is selected, it outputs NOP instructions during the 
period of cycles equal to the NOP number contained in the 
small instruction. In this manner, NOP instructions are 
generated as many as the NOP number contained in the 
small instruction. In this manner, the small instruction 
generating circuit time-expands the small instruction 
obtained through space expansion by the group code 
expanding circuit, to thereby generate a plurality of small 
instructions compressed neither in time nor in space. The 
other small instruction generating circuit 303-2 operates in 
the similar manner as above, in cooperation with the OP 
code queue 302-2, operand queue 305-2, and NOP number 
queue 306-2. 

The operation of the processor shown in FIG. 1 will be 
further described in detail. As shown in FIG. 4, the group 
code expanding circuit 301a has an instruction table 
memory 307 for storing the instruction table 30 A In this 
embodiment, it is assumed that this memory has a capacity 
capable of storing the whole of the instruction table 30 A 
This assumption is also applied to other instruction, expand- 
ing circuits. Prior to executing the program shown in FIG. 
12D, an instruction is executed which instruction is used for 
loading the instruction tables 30A, 30B and 30C shown in 
FIGS. 12A-12C in the instruction table memories 307 of the 
instruction expanding circuits 300a, 300& and 300c. 

This preload instruction may be set in place of one 
grouped instruction of the time- and space-compressed 
VLIW instruction lOOfc shown in FIG. 2C. FIG. 2D shows 
a long instruction 100c containing a preload instruction 16 
for the instruction expanding circuit 300a. In this long 
instruction 100c, the preload instruction 16 is contained in 
the instruction field 21 of the LD/ST group. The operand 
sub-fields of this preload instruction are set with the start 
address 17b of an area (block) in the main storage to be 
loaded (in this example, the start address (sub-field 17) of 
the instruction table 30A) and the number (sub-field 18) of 
entries to be loaded. The sum of the length of the OP code 
16 of this preload instruction and the length of the operand 
sub-fields 17 and 18, i.e., the length of the preload 
instruction, is the same as that of one grouped instruction 
shown in FIG. 2C. In the following description of processing 
this preload small instruction, it is considered as the grouped 
instruction in some cases. NOP instructions are also set to 
the other fields of this long instruction 100c. In this 
embodiment, however, the other fields may be set with other 
grouped instructions or preload instructions for other 
instruction expanding circuits. 

Prior to fetching the instruction series 40 show in FIG. 
12D, the instruction fetch unit 202 shown in FIG. 1 fetches 
the long instruction 100c (FIG. 2D) containing the preload 
instruction used by the instruction expanding circuit 300a 
and other long instructions containing the preload instruc- 
tions for the other instruction expanding circuits. For 
example, as the long instruction 100c containing the preload 
instruction for the instruction table 30A is fetched by the 
instruction fetch unit 202, the contents of a plurality of 
grouped instruction fields of this long instruction are trans- 
ferred to the instruction expanding circuits 300a, 3006, 300c 
and 300a* of the instruction expanding unit 300. Therefore, 
the instruction expanding circuits 300/>-300d become ready 
for expanding other grouped instructions. Along instruction 
filled with all preload instructions may be used. In this case, 
if this long instruction is fetched once, preloads for all 
function units can be completed by one process. 

The instruction expanding circuit, for example, the 
instruction expanding circuit 300a shown in FIG. 3, has the 
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group code "FF* of the invalid instruction (FF instruction) signals 418-2, 418-3 and 418-4 from the other instruction 

stored in the FF register 309. When the preload small expanding circuits 300J>, 300c and 300d by an OR gate 518, 

instruction is supplied via the line 403 from the instruction and an output of the OR gate is sent via a line 418 to the 

fetch unit 202, the comparator 308 compares "FF" in the instruction fetch unit 202. Therefore, any queue in each 

register 309 with the OP code of the grouped instruction 5 instruction expanding circuit will not lose information in a 

supplied from a line 403-1. If this comparison does not small instruction supplied from the instruction fetch unit 

detect a coincidence, the information in this grouped instruc- 202. 

tion is permitted to be stored in the queues 304, 305-1, The group code at the top of the group code queue 304 is 

306-1, 305-2 and 306-2. Namely, the two operands in the input via the line 500 to the group code expanding circuit 

transferred preload instruction are distributed to and stored 10 301. In this group code expanding circuit 301, the instruc- 

in the first operand queue 305-1, first NOP number queue tion table memory 307 outputs a plurality of OP codes 

306-1, second operand queue 306-2, and second NOP num- among the OP code group stored in the instruction table 30A 

ber queue 306-2 via lines 403-2 - 403-5. The OP code of this at the location designated by the input group code. An output 

instruction is transferred to and stored in the group code of the comparator 807 is supplied via an inverter 804 to 

queue 304 via a line 510. Assuming that this queue is empty 15 switches 806-1 and 806-2. If the output of the comparator 

before this time, the OP code of this instruction is immedi- 807 is "0", i.e., if the OP code supplied via the line 500 does 

ately transferred via a line 500 to the group code expanding not correspond to the preload instruction, these switches 

circuit 301a, and the two operands are transferred from the operate to supply the plurality of OP codes read from the 

queues to the group code expanding circuits 301 via a line instruction table memory 307 to the OP code queues 302-1 

506. M and 302-2 via lines 504-1 and 504-2. The output of the 

Referring to FIG. 4, a preload instruction register 808 of comparator 807 is also supplied via a line 505 to the small 

each group code expanding circuit 301 (e.g., 301a) stores in instruction generating circuit 303-1. If the output of the 

advance the OP code of the preload instruction. When the comparator 807 is "1", i.e., if the OP code supplied via the 

OP code is transferred from the group code queue via the line 500 corresponds to the preload instruction, the small 

line 500, it is compared with the OP code in the preload 25 instruction generating circuit 303-1 is notified that the output 

instruction register 808 by a comparator 807 to judge of the group code expanding circuit 301 is invalid, 

whether the transferred OP code corresponds to the preload The OP code queues 302-1 and 302-2 are first-in first -out 

instruction. queues. Of a plurality of OP codes input to these queues, the 

If the transferred OP code is coincident with the OP code first input OP codes are stored at the top locations of the 

in the preload instruction register 808, an output 810 of the 30 queues and output to the lines 501-1 and 501-2. The OP code 

comparator 807 becomes "V\ An address register 801 stores output from the OP code queue 302-1 to the line 501-1 and 

the start address of the instruction table to be preloaded, and the operand group stored in the top locations of the operand 

an entry number register 802 stores the effective entry queue 305-1 are combined to restore one small instruction, 

number of the instruction table supplied via the line 506. A This restored small instruction is supplied via a line 50-1 to 

memory access circuit 310 is activated by the output 810 35 the small instruction generating circuit 303-1, 

from the comparator 807 to access the main storage 100 at As shown in FIG. 5, the small instruction generating 

the location starting from the address stored in the address circuit 303-1 has a selector 600 and a circuit 620 for 

register 801 to the addresses corresponding to the entry controlling the select operation of the selector 600. The 

number in the entry number register 802 via a fine 402 and selector 600 selects either a small instruction input via the 

read the instruction table 30A and store it in the instruction 40 line 50-1 or a NOP instruction input via a line 605 from a 

table memory 307. In this manner, the instruction table 30A register 604. A counter 601 of the control circuit 620 counts 

is preloaded. The other long instructions containing preload the NOP number supplied via the line 503-1 from the NOP 

small instructions for the instruction tables 30B and 30C are number queue 306-1 (FIG. 3) to generate NOP instructions 

thereafter sequentially prefetched from the main storage 100 same in number as the NOP number. The initial value of the 

and the preload small instructions are expanded by the 45 counter 601 is set to "0". A comparator 609 compares the 

instruction expanding circuits 3006 and 300c. count of the counter 601 with a constant "0" stored in a 

Thereafter, the VUW instruction series shown in FIG. register 610. If a coincidence is detected, the comparator 609 

12D is sequentially fetched by the instruction fetch unit 202 outputs a value "1". Therefore, when the small instruction is 

from the main storage 100 shown in FIG. 1. A plurality of supplied via the line 50-1, the output of the comparator 609 

grouped instructions of each VLIW instruction are trans- 50 is "1". This output is supplied via an AND gate 606 to an 

ferred to the instruction expanding circuits 300a-300i/. AND gate 608. The AND gate 606 performs a logical 

Similar to the preload instruction described previously, in product of the output of the comparator 609 and a signal on 

each instruction expanding circuit (FIG. 3), the comparator the line 505 from the group code expanding circuit. If the 

308 checks whether the transferred grouped instruction is signal on the line 505 is "0", the output of the AND gate 608 

the invalid instruction (FF instruction). If this grouped 55 becomes "0" to make the selector 601 output a NOP 

instruction is not the invalid instruction (FF instruction), the instruction. Synchronously with a master clock CLK1 of the 

group code queues 304, operand queue 305-1, NOP number processor shown in FIG. 1 and supplied via a line 611, the 

queue 306-1, operand queue 305-2 and NOP number queue AND gate 608 sends the output "1" of the comparator 609 

306-2 receive via the lines 403-1 403-5 the group code, the to the selector 600. In response to this signal, the selector 

operand group of the first small instruction in the grouped 60 600 selects the small instruction supplied via the line 50-1 

instruction, the NOP number of the first small instruction, and sends it to a corresponding fiinction unit via a line 404-1. 

the operand group of the second small instruction in the Synchronously with the output "1" of the AND gate 606 

grouped instruction, and the NOP number of the second and a half-cycle shifted clock CLK2 on a signal line 612, an 

small instruction. When each queue detects an overflow, it AND gate 614 instructs the counter 601 to receive the NOP 

outputs a signal to a signal line 418-1 to request the 65 number on the signal line 503-1. Namely, after a half-cycle 

instruction fetch unit 202 to suspend the transmission of from when the small instruction is supplied via the line 50-1, 

succeeding instructions. This signal is ORed with similar the counter 609 receives the NOP number. Therefore, the 
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output of the comparator 609 becomes "0" thereafter. As a selects the recovered small instruction, the queues 304, 
result, when the next clock CLK1 is supplied, the output of 305-1 and other queues connected to the small instruction 
the AND gate 610 becomes "0" and the selector 600 selects generating circuit 303-1 prepare output information for the 
the NOP instruction in the NOP instruction register 604 and nex t VUW instruction. Similarly, the small instruction gen- 
outputs it to a line 404. Since the inverted output of the s erating circuit 303-2 outputs a small instruction "LD FR1, 
comparator 609 and the clock CLK1 are input to the AND t» m6 tne N qp number counter 601-1 receives the NOP 
gate 616, the counter 601 counts down by "1" in response to num ber "0" supplied from the second NOP number queue 
the clock CLK1. Similarly, the NOP instruction is repeated 306-2 

untU the count of the counter 601 becomes . «0». In the above ' ^owd in FIG. 12D 

manner a small instruction and la plurality of succeeding 10 fom ^ instruction ^ wit 202 to the 

NOP instructions are generated. Each tune a comparison by . 4 ,. . 4 . An , , . 

. . * t\t\ • . ,• , , , 5. J instruction expanding unit 300, and processed in the manner 

the comparator 609 is established, the output or the com- . . £ . , . ' m . . „ 

. Av rt • A r j • i- *.ao ■% . ^ * j similar to the first cycle. This instruction (2) is different from 

parator 609 is transferred via a line 508-1 to the first operand - t . + . \ - 4 « 4 « « j a \ir\n 

t ac i ~i /^r* jt 1 j . , /-\rt the first instruction (1) in that the first and second NOP 

queue 305-1 and OP code queue 302-1, and via the OR gate , UA „ 1/ , ... . . 

^ AAi Al _ j *uwa* iL i_ .*r i ?• numbers are 4 . Therefore, after this instruction is 

509 to the group code queue 304 to thereby notify a selechon 1S ^ ^ m& ^ on ^ circuit 300fl) - 4 - fc 

of informauon output from these queues. In response to this ^ ^ NOP counters 601-1 and 601-2. 

notice, these queues prepare to output the next information. . . . . 

Similarly, in the small instruction generating circuit 303-2, At ^ tturd c y cle > 1136 instruction (3) shown in FIG. 12D 

the output of the comparator 609 is transferred via a line * transferred from the instruction fetch unit 202 to the 

508-2 to the first operand queue 305-2 and OP code queue M instruction expanding unit 300. Similar to the first cycle, 

302-2, and via the OR gate 509 to the group code queue 304. information of this instruction is stored in each queue. 

In this manner, each time the small instruction generating However, since the values of the NOP counters 601-1 and 

circuit 303-1 selects one recovered small instruction, the 6012 m not ^ sma11 instruction generating circuits 

above-described queues output the information of the next 303 1 ™ d 303 ~ 2 of ^ instruction expanding circuit 300a 

VUW instruction. As already described, when the preload 9 , select the NOP instructions supplied from the registers 604 

instruction is transferred to the instruction expanding circuit to decode ™t 203 . At this time the counts of the NOP 

300a, a notice of this, effect is informed to the small counters 601-1 and 601-2 are decremented. The small 

instruction generating circuit 303-1 via the line 505. instruction generating circuits 303-1 and 303-2 do not select 

In the following, how the instruction expanding circuit ™!™ Ctl ° n (3) *om the instruction i fetch unit 

300a shown in FIG. 3 expands the ITS instruction group in 30 ^ T^f- f thlS ^ ^t™*™ ( 3 > 15 not 

j ■ * *• u r?^ im expanded and mformation stored in each queue is main- 

the space-compressed instruction senes shown in FIG. 12D f , . . M 

tfiifif^ri it 

will be described more in particular with reference to FIGS. 

13A, 13B and 13C. FIGS. 13A, 13B and 13C show the At the fourth to sixth cycles, similar to the third cycle, the 

contents of the group code queue 304, operand queues 305-1 instructions (4)-(6) shown in FIG. 12D are sequentially 

and 305-2, OP code queues 302-1 and 302-2 and NOP 35 supplied from the instruction fetch unit 202 to the instruction 

number queues 306-1 and 306-2, the contents of the NOP expanding unit 300. At these cycles, since the values in the 

number counters 601-1 and 601-2 of the small instruction NOP counters 601-1 and 601-2 are not "0", the small 

generating circuit 303-1, and small instructions output to the instruction generating circuits 303-1 and 303-2 of the 

decode unit 203 from the small instruction generating circuit instruction expanding circuit 300a select the NOP instruc- 

at each cycle. 40 uon s supplied from the registers 604, similar to the third 

As shown in FIG. 13A, at the first cycle, an instruction (1) c y cle • At this of tne N0P v™*** 6014 and 

shown in FIG. 12D is transferred from the instruction fetch 6012 decremented. Information of the instructions (4)- 

unit 202 to the instruction expanding unit 300. As the (6) stored in each queue is mamtamed as it is. At the end of 

instruction expanding circuit 300a receives the L/S instruc- *e sixth cycle, the counts of the NOP counters 601-1 and 

turn group in this instruction, the group code "1", the first 4 5 601-2 become In tne above manner > S1X ^ s LD/ST 

operand group "FR26, Z(K+10)", the first NOP number "0", instructions of the first six VLIW instructions shown in FIG. 

the second operand group "FR2, T" and the second NOP 10 m recovered. 

number "0" are stored in the group code queue 304, the first As the seventh cycle shown in FIG. 13B, the instruction 

operand queue 305-1, the first NOP number queue 306-1, the (7) shown in FIG. 12D is transferred from the instruction 

second operand queue 305-2 and the second NOP number 50 fetch unit 202 to the instruction expanding unit 300. The 

queue 306-2, respectively. Since the information stored in LD/ST small instruction group of this instruction (7) is an 

these queues are first information, it is output immediately. instruction obtained by space-compressing a pair of NOP 

The group code expanding circuit 301 outputs a pair of OP instructions and has no operand as is expressed by empty 

codes "LD, LD" at the first entry of the instruction table 30A columns in FIG. L2D. However, in an actual case, in these 

stored in the instruction table memory to the OP code queues 55 too fields, a bit pattern of the NOP instruction without the 

302-1 and 302-2, because the group code supplied from the operand sub -field is inserted at the top location of each field, 

group code queue 304 is "1". Since the OP codes "LD" and At the seventh cycle, since the counts of the NOP counters 

"LD" supplied to the first and second OP code queues 302-1 601-1 and 601-2 are already "0", the instruction expanding 

and 302-2 are the first OP codes, these OP codes are output circuit 300a restores a non-compressed small instruction in 

immediately to the lines 501-1 and 501-2. 60 accordance with the information stored in each queue at the 

The small instruction generating circuit 303-1 selects a top location, similar to the first cycle. In this example, in 

recovered small instruction "LD FR26, Z(K+10)" consti- accordance with the information of the instruction (3), a pair 

tuted by the OP code supplied from the first OP code queue of LD/ST instructions constituting the seventh VLIW 

302-1 and the operand output from the first operand queue instruction shown in FIG. 10 is recovered. Similarly, at the 

305- 1. The NOP number counter 601-1 receives the first 65 eighth cycle, the instruction (8) shown in FIG. 12D is 
NOP number "0" supplied from the first NOP number queue transferred from the instruction fetch unit 202 to the instruc- 

306- 1. After the small instruction generating circuit 303-1 tion expanding unit 300. The instruction expanding circuit 
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300a processes the instruction (8) in a manner similar to the 20th VLTW instruction shown in FIG. 10 and corresponding 
seventh cycle. At this eighth cycle, in accordance with the to the instruction (7) (this instruction is obtained by space- 
information of the instruction (4), a pair of LD/ST ins true- compressing a pair of NOP instructions) in this example, in 
tions constituting the eighth VUW instruction shown in accordance the information stored in each queue at the lop 
FIG. 10 is recovered. Thereafter, the first and second NOP 5 location. After this recovery, the first and second NOP 
numbers "2" and "2" of the instruction (8) are stored in the numbers J'™* ^tatnictom (7) are set to the NOP 

NOP registers 601-1 and 601-2. counteis ™}} ^ d *°V 2 ' S ™ e ™ Uais ° f * e *° P 

& t . . , counters 601-1 and 601-2 are different, the small instruction 

At the ninth and tenth cycles, the instructions (9) and (10) generating circuits 303.1 303-2 operate differently 

shown in FIG. 12 are transferred from the instruction fetch thereafter 

unit 202 to the instruction expanding unit 300^ At these 10 ^ ^ ^ ^ ^ ^ ^ ^ NQp 

cycles, since the counts of the NOP counters 601-1 and m _ ± ^ ^ smaU generating circuit 303-1 

™™T ' I 1 mst ™ ciL0Ji gyrating circuits Qf ^ instmction cxpanding circuit 3 00a restores the first 

303-1 and 303-2 of the instruction expanding circuit 300* ^ instruction « STXJ FR11 X(K) » m accordance with the 

output the NOP ^instructions similar to the third cycle. The Mom ^ n stored in ^ 0P code queue 302-1 and first 
counts of the NOP counters 601-1 and 601-2 are set to CT 15 ^ ^ ^ ^ ^ 

during the tenth cycle. example, in accordance with the information of the first 

The instruction (1) transferred from the instruction fetch sma ][ instruction used for generating the instruction (8). 
unit 202 at the tenth cycle is the valid instruction (FF mis reC overy, the first NOP number stored in the first 

instruction). As the comparator 308 detects that the OP code jsjQp num ber queue 306-1 at the top location, in this 

of the instruction (10) is the same as "FF" in the FF resister example, the value "0", is set to the NOP counter 601-1. On 

309, each queue of the instruction expanding circuit 300a me S ^ G Q f me sma ij instruction generating circuit 303-2, 

does not receive the information of this instruction. This smce me count 0 f me ^OP counter 601-2 is not "0", the 

process for the invalid instruction is executed in the same NO p instruction is output and the NOP counter is decre- 

manner as the 11th to 13th cycles. mented. As a result, the instruction expanding circuit 300a 

At the 11th cycle, since the counts of the NOP counters recovers the LD/ST instruction group constituted of a pair of 

601-1 and 601-2 are "0", the instruction expanding circuit a valid small instruction and a NOP instruction. This instruc- 

300a restores non-compressed instructions in accordance tion group is a portion of the 21th instruction shown in FIG. 

with the information stored in each queue at its top location, 10. In this embodiment, a small instruction group containing 

similar to the seventh cycle. In this example, in accordance 3Q a valid small instruction and a NOP instruction can be 

with the information of the instruction (5), a pair of LD/ST recovered by using a portion of a space -compressed instruc- 

instructions constituting the 11th VLIW instruction shown in tion group. The remaining portion of the space-compressed 

FIG. 10 is recovered. After this recovery, the first and second instruction group can be used, as will be described below, for 

NOP numbers "0" and "(T of the instruction (5) are set to the recovering another small instruction group containing 

NOP counters 601-1 and 601-2. Therefore, aLso at the next 35 another valid small instruction and a NOP instruction. 
12th cycle, since the counts of the NOP counters 601-1 and ^ the 22nd cycle, since the count of the NOP counter 

601-2 are "0", the instruction expanding circuit 300a goi-l is "0", the small instruction generating circuit 303-1 . 

restores a pair of LD/ST instructions constituting the 12th 0 f me instruction expanding circuit 300a restores, similar to 

VLIW instruction shown in FIG. 10 and corresponding to the 21th cycle, the first small instruction "STU FR18, 

the instruction (6) in this example, in accordance the in for- 4Q x(K+l)" in accordance with the information stored in the 

mation stored in each queue at the top location, similar to the OP code queue 302-1 and first operand queue 305-1 at the 

11th cycle. After this recovery, the first and second NOP top locations thereof , in this example, in accordance with the 

numbers "7" and "7" of the instruction (6) are set to the NOP information of the first small instruction used for generating 

counters 601-1 and 601-2. the instruction (9). After this recovery, the first NOP number 

During the period from the 13th cycle (FIG. 13B) to the 45 stored in the first NOP number queue 306-1 at the top 

19th cycle (FIG. 13C), the counts of the NOP counters 601-1 location, in this example, the value "3", is set to the NOP 

and 601-2 are not "0**. Therefore, the instruction expanding counter 601-1. On the side of the small instruction gener- 

circuit 300a repetitively outputs the NOP instruction similar ating circuit 303-2, since the count of the NOP counter 601-2 

to the third to sixth cycles. After the NOP instruction is is not "0", the NOP instruction is output and the NOP 

output at the 19th cycle, the counts of the NOP counters 50 counter is decremented. 

601-1 and 601-2 become "0". At the 23rd, since both the NOP counters 601-1 and 601-2 

At the cycles after the 14th cycle, if a predicted branch are not "0", the small instruction generating circuits 303-1 

destination is established, this predicated instruction is and 303-2 of the instruction expanding circuit 300a output 

sequentially supplied at the prefetch stage to the instruction NOP instructions and the NOP counters 601-1 and 601-2 are 

expanding circuits 300a-300a*. If the predicated branch 55 decremented. 

destination is not established, it is determined that there is no At the 24th cycle, since the count of the NOP counter 

input. If the predicted branch destination is established and 601-2 is "0", the small instruction generating circuit 303-2 

the predicted instruction is stored in each queue of the 0 f the instruction expanding circuit 300a restores the second 

instruction expanding circuit, then the following operation is small instruction "STU FR25, X(K+2)' ' in accordance with 

performed. If a branch having a different direction from the $q the information stored in the second OP code queue 302-2 

predicted branch direction occurs, the branch unit 207 and second operand queue 305-2 at the top locations thereof, 

notifies a prediction miss to each instruction expanding in this example, in accordance with the information of the 

circuit via a signal line 415, and each instmction expanding second small instruction used for generating the instruction 

circuit invalidates the instruction in each queue. (8). After this recovery, the second NOP number stored in 

At the 20th cycle, since the counts of the NOP counters 65 the second NOP number queue 306-2 at the top location, in 

601-1 and 601-2 are "0", the instruction expanding circuit this example, the value "0", is set to the NOP counter 601-2. 

300a restores a pair of NOP instructions constituting the On the side of the small instruction generating circuit 303-1, 
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since the count of the NOP counter 601-1 is not "0", the 
NOP instruction is output and the NOP counter is decre- 
mented. As a result, the instruction expanding circuit 300a 
recovers a LD/ST instruction group constituted of a pair of 
a NOP instruction and a valid small instruction. Ibis instruc- 
tion group is a portion of the 24th instruction shown in FIG. 
10. As described above, in this embodiment, a small instruc- 
tion group containing a valid small instruction and a NOP 
instruction can be recovered by using the remaining infor- 
mation of the instruction (8) partially recovered at the 21st 
cycle. 

At the 25th cycle, the small instruction generating circuits 
303-1 and 303-2 of the instruction expanding circuit 300a 
recover a NOP instruction and a small instruction "STU 
FR32, X(K+2)", in a manner similar to the 24th cycle. Since 
the branch instruction is executed at the 25th cycle, the 
branch destination is determined so that it can be judged 
whether the branch prediction is correct or not. If correct, the 
information stored in each queue is used to expand the 
instruction by the above-described procedure. If not correct, 
the instruction stored in each queue based on the branch 
prediction is invalidated so that the instruction expansion 
can be conducted in the manner similar to the first cycle 
shown in FIG. 13A. As appreciated from the foregoing 
description, in this embodiment, a time- and space - 
compressed instruction group is expanded to restore a plu- 
rality of non -compressed small instructions and NOP 
instructions. 

It is assumed here that the OP code field 10 of the 
non-compressed VLIW instruction shown in FIG. 2 A is 12 
bits and the operand field 11 is 20 bits. Each small instruc- 
tion has therefore a length of 32bits, and the VLIW instruc- 
tion shown in FIG. 2A has a length of 32 bitsx7 fields=244 
bits. The program shown in FIG. 10 written by VLIW 
instructions has a capacity of 224 bitsx25 insrructions=700 
bytes. As the program shown in FIG, 10 is time-compressed 
by using the VLIW instructions shown in FIG. 2B, the 
number of necessary instructions is reduced to 13 instruc- 
tions as shown in FIG. 11. As the program shown in FIG. 11 
is space -compressed by using the VLIW instructions shown 
in FIG. 2C, the combinations of OP codes of instructions of 
this program are four, five and one respectively for the I7S 
instruction group, FL instruction group and FX instruction 
group, as shown in FIGS. 12A, 12B and 12C. Therefore, the 
group code 13 of the space-compressed VLIW instruction 
shown in FIG. 2C is sufficient if it has there bits. The group 
code is not necessary for the branch instruction sub-field la 
of this VLIW instruction. Assuming that the NOP number 
sub-field 12 is three bits, the length of a VLIW instruction 
is three bits (length) xthree groups+20 bits (operand length) 
xsix sub-fields (other than branch instruction) +32 bits 
(branch instruction)+three bits (NOP number sub-field for 
branch instruction)=182 bits, because the VLIW instruction 
has two LD/ST instructions, two floating point calculation 
instructions, and one branch instruction. Therefore, even if 
the bit length of one VLIW instruction increases because of 
the NOP number sub-field used for time compression, a 
shorter bit length can be obtained than the non-compressed 
VLIW instruction, by using the group code for space com- 
pression. 

The space-compressed program has 13 instructions so that 
the size of this program is about 296 bytes. The total number 
of entries of the instruction tables 30A, 30B and 30C is 10 
entries so that the size of these tables is 30 bytes. The total 
size of both the program and the tables is reduced by about 
53.4% of the original program. As compared with the fourth 
reference document (22 VLIW instructionsx32-bit small 
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instructionxseven fields~616 bytes), the total size of the 
instruction tables is as very small as about 30 bytes which is 
one 21st the fourth reference document Therefore, the 
instruction tables may be stored in a faster memory provided 

5 in the processor. 

A small instruction of the codes shown in FIG. 10 and 
compressed by the method of the fourth reference document 
becomes as long as 32 bits because it has no NOP number 
field. There are 22 different instruction combinations, and 

30 the fields which can be shared are only for the fields of NOP 
instructions. Therefore, if five bits are assigned to a code 
representative of an instruction, the total size of the program 
is five bits x25 instructions+32 bitsxseven fieldsx22 
combinations+622 bytes (including 616 bytes for instruction 

15 tables). This size reduction is only about 11% of the original 
program. In the case of the first reference document, assum- 
ing that a delay cycle field added to each VLIW instruction 
is three bits, the number of instructions is reduced to 21 
instructions in this example because there are four instruc- 

20 tions having all fields filled with NOP instructions. 
However, one VLIW instruction is 32 bitsxseven fields+ 
three bits-227 bits. The total size of the program is therefore 
about 596 bits, which means a reduction of about 15% of the 
original program. In the case of the second reference 

23 document, assuming that a delay cycle field added to each 
VLIW instruction is three bits same as the case of the first 
reference document, one VLIW instruction is 32 bitsxseven 
fields+three bitsx7=243 bits. However, the program has 13 
instructions as shown in FIG. 11 because of the effects of 

30 removing NOP instructions so that the total size of the 
program is about 398 bytes, which means a reduction of 
about 43% of the original program. 

The necessary capacity of the memory 307 used in this 
embodiment for storing an instruction table of the group 

35 code expanding circuit 301 can be reduced in the following 
manner. Assuming that the program shown in FIG. 12D is 
used, the sizes of the instruction tables 30A, 30B and 30C 
are different and that of the instruction table 30A is largest. 
The larger the size of the instruction table, as the number of 

40 OP code types of the program increases more. In the above 
embodiment, the instruction table memory of each instruc- 
tion expanding circuit is required to store the whole of one 
instruction table. The instruction expanding circuit storing a 
large instruction table is required to provide a large instruc- 

45 tion table memory. A smaller instruction table memory than 
the above embodiment may be used in the following manner. 
Namely, an instruction table stored in one instruction 
expanding circuit is divided into a plurality of partial 
instruction tables. Upon a request from a program, a nec- 

50 essary partial instruction table is read from the main memory 
and copied to the instruction table memory of the instruction 
expanding circuit. 

For example, the instruction table 30A is divided into a 
plurality of partial instruction tables, the instruction table 

55 30A storing a plurality of OP code pairs corresponding to a 
plurality of group codes which are used during the execution 
of the space-compressed program shown in FIG. 12D. The 
start partial instruction table among these partial instruction 
tables is preloaded from the main storage 100 to the group 

60 code expanding circuit 301. During the execution process of 
the space-compressed program shown in FIG. 12D, a proper 
succeeding partial instruction table is copied from the main 
storage 100. 

In order to realize the above operation, the program 
65 shown in FIG. 12D is divided into a plurality of partial 
programs during compiling, and a partial instruction table is 
generated which stores OP code pairs of the VLIW instruc- 
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tion contained in each partial program. Aphirality of partial corresponding to the group code from the instruction table 

instruction tables generated in this manner are stored in the 30A stored in the main storage 100, via a line 402. In 

main storage 100. A compressed long instruction is embed- response to the mishit signal, the instruction table memory 

ded at the top of each partial program, the long instruction 700 stores therein the OP code pair read from the main 

containing a preload instruction for preloading the partial s storage 100 by the read circuit via the line 402, at the 

instruction table to be used by each partial program in the memory location corresponding to the OP code on the line 

main storage 100. This long instruction has the same format 500. The read OP code pair is further supplied, as the valid 

as the long instruction lOOrf shown in FIG. 2D. In this case, OP code pair, via the lines 504-1 and 504-2 to the non- 

the other fields of this long instruction may be set with valid compressed OP code queues 302-1 and 302-2 (FIG. 3). The 

grouped instructions other than NOP instructions to be used 10 mishit signal is supplied via the line 709 and OR gate 711 to 

for other instruction expanding circuits. The circuit structure the small instruction generating circuits 303-1 and 303-2 

of the process is the same as that of the above-described (FIG. 3) to instruct these circuits to generate NOP instruc- 

embodiment. The structure of the instruction table of this tions. 

modification is not necessarily applied to all the instruction The above operation is performed for each of a plurality 

expanding circuits, but may be applied to only the instruc- 15 0 f succeeding group codes supplied from the group code 

tion expanding circuit having a possibility of using a rela- queue 304. [f the tag memory 701 accessed by using the 

tively large instruction table. succeeding group code supplied via the line 500 hits, the 

In order to reduce the size of the instruction table memory, instruction table memory 700 supplies the OP code pair 

an instruction table memory of a cache type may be used. stored at the memory location corresponding to this group 

In this case, a group expanding circuit 300a shown in FIG. 20 code, as the valid OP code pair, to the non-compressed OP 

6 is used in place of the group expanding circuit 300a shown code queues 302-1 and 302-2 (FIG. 3) via the lines 5-4-1 and 

in FIG. 4. Prior to fetching the instruction series shown in 504-2. 

FIG. 12D from the main storage 100, a long instruction In the example shown in FIG. 6, only when the group 

containing a preload instruction shown in FIG. 2D is used by code is not being stored in the tag memory 701, the 

the instruction expanding circuit 300a to preload the con- 25 corresponding OP code pair is fetched from the main stor- 

tents of the instruction table 30A In this case, the preload age. Therefore, the size of the instruction table memory 700 

instruction has no valid information about the entry number can be reduced more than the example of FIG. 4. 

field 18. This preload instruction is not used for preloading i n order to reduce the size of the instruction table memory, 

the instruction table 30A, but used for setting the load start the fact that a use frequency of an OP code pair by the 

address to the group code expanding circuit 300a. 30 program is different for each OP code pair may be used. This 

Referring to FIG. 6, a preload instruction register 808 example uses a fixed instruction table memory for fixedly 

stores in advance the OP code of the preload instruction like storing a plurality of frequently used OP code pairs and a 

that shown in FIG. 4. Each time a grouped instruction is variable instruction table memory for storing a plurality of 

supplied from the group code queue 304 via the line 500, the 35 OP code pairs not frequently used. 

comparator 807 compares the OP code with that stored in the In the group code expanding circuit 301a shown in FIG. 

preload instruction register 808. If a coincidence is detected 7, reference numeral 901 represents a fixed instruction table 

during this comparison, the supplied grouped instruction is memory, and reference numeral 900 represents a fixed 

considered as the preload instruction. At this time, the load instruction table memory. The fixed instruction table 

start address 17 supplied via the signal line 506 is stored in 4Q memory 901 stores a plurality of OP code pairs correspond- 

a base register 702 as the base address of the memory area ing to a plurality of predetermined group codes. No particu- 

in which the instruction table 30Ais stored. An output of the lar instruction table is not loaded from the main storage 100 

comparator 807 is also supplied to the small instruction to this memory 901. A partial instruction table necessary 

generating circuits 303-1 and 303-2 via a line 810, an OR during the execution of the program shown in FIG. 12D is 

gate 711, and the line 505 to instruct these circuits to 45 stored in the variable instruction table memory 900. In this 

generate NOP instructions, similar to the case of FIG. 4. The example, a program to be executed is divided into a plurality 

preload instruction is executed in the manner similar to the of partial programs, a corresponding partial instruction table 

above for the other instruction expanding circuits. is determined and stored in the main storage 100, and a long 

Thereafter, the instruction series shown in FIG. 12D is instruction shown in FIG. 2D and containing a preload 

sequentially fetched. Each time the group code of each 50 instruction for preloading the partial instruction table used 

instruction is supplied via the line 500, a tag memory 701 by each partial program is embedded in each partial program 

and an instruction table memory 700 are accessed by using at the top location. These operations are the same as the 

the group code. The tag memory 701 is used for storing a modification described earlier. 

plurality of group codes at corresponding memory locations However, in the example shown in FIG. 7, the partial 

thereof. Ihe instruction table memory 700 is used for storing 55 instruction table stores a plurality of OP code pairs corre- 

a plurality of OP codes corresponding to a plurality of group sponding to a plurality of group codes used by each partial 

codes registered in the tag memory 701, at corresponding program, excepting those OP codes stored in the fixed 

memory locations thereof. instruction table memory 301. Each group code has the 

If the tag memory 701 is accessed by using the group code number of bits allowing to discriminate between the fixed 

on the line 500 and this group code is mishit, the tag memory 60 instruction table and variable instruction table. For example, 

701 outputs a mishit signal to a line 709 and stores therein the group code corresponding to the OP code pair stored in 

the group code at the corresponding memory location the fixed instruction table has the most significant bit "0", 

thereof. The memory access circuit 703 has a circuit (not whereas the group code corresponding to the OP code pair 

shown) for generating a main storage address in response to stored in the variable instruction table has the most signifi- 

this mishit signal by adding the base address in the base 65 cant bit "1". 

register 702 to the group OP code on the line 500, and The preload instruction added to the top location of each 

another circuit (not shown) for reading an OP code pair partial program operates to read the partial instruction table 
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from the main storage 100, which is the same as the 
modification described previously. In this example, the read 
partial instruction table is stored in the variable instruction 
table memory 900. In accordance with the group code 
supplied via the line 500, the two instruction table memories 5 
900 and 901 are accessed at the same time, and a proper one 
of the read two OP code pairs is selected by a selector 902 
in accordance with the most significant bit of the group code. 
The selected OP code pair is supplied to the non-compressed 
OP code queues 302-1 and 302-2 (FIG. 3) as the valid OP no 
code pair. As appreciated from the foregoing description, in 
this example, the size of the partial instruction table can be 
reduced, while also reducing the number of operations of 
loading the partial instruction table from the main storage 
100 to the group code expanding circuit. 15 

A further modification is possible wherein the fixed 
instruction table is preloaded from the main storage 100 to 
the memory 901 before the program is executed. The 
memories 900 and 901 may use memories of a cache type 
described earlier. 20 

As described so far, each small instruction of a VLIW 
instruction is divided into an OP code and an operand 
sub-field, and a sub-field for writing the number of succeed- 
ing NOP instructions is provided in the operand sub-field. 
Accordingly, the number of NOP instructions can be 25 
reduced, the time density of valid instructions can be 
increased, and the use efficiency of the main storage and an 
instruction cache can be improved. Furthermore, the number 
of combinations of OP code sub-fields of small instructions 
contained in VLIW instructions can be reduced so that a 30 
possibility of using the same group code representative of 
the same pair of OP code sub-fields increases. Since a 
plurality of OP codes share the same sub-field, even if the 
instruction parallel degree becomes high, only the number of 
operand fields of the VLIW instruction is increased so that 35 
the length of the VLIW instruction can be suppressed from 
being elongated more than conventional method. 

Furthermore, since the structure of the compressed 
instruction expanding mechanism is simple, the machine 
cycle is not adversely affected and the long instruction with 40 
a high compression factor can be executed fast. 

Still further, since the time compression is performed by 
reducing NOP instructions and the valid small instruction is 
fetched before it is executed, it is possible to check in 
advance the OP codes of a branch instruction and an L/S 45 
instruction. Therefore, prediction of a branch destination 
address and data prefetch can be realized. 

What is claimed is: 

1. A VLIW instruction processor comprising: 

a long instruction supply circuit which sequentially sup- 
plies a plurality of compressed long instructions con- 
stituting a program to be executed; 

an instruction expanding unit which expands one com- 
pressed long instruction supplied from said long 55 
instruction supply circuit and generates a plurality of 
small instructions; and 

a plurality of function units which execute the plurality of 
small instructions in parallel, 

wherein the plurality of function units are divided into a 60 
plurality of function unit groups; 

wherein said instruction expanding unit has a plurality of 
instruction expanding circuits operating in parallel and 
provided in correspondence with one of the plurality of 
function unit groups; 65 

wherein each compressed long instruction has a plurality 
of instruction fields corresponding to one of the phi- 
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rality of function unit groups, each instruction field 
includes a compressed, grouped instruction represen- 
tative of a group of small instructions to be executed in 
parallel by each function unit corresponding to said 
each instruction field, the compressed, grouped instruc- 
tion being represented by a smaller number of bits than 
the number of total bits of the group of small instruc- 
tions; 

wherein each instruction expanding circuit expands a 
compressed, grouped instruction in a long instruction 
supplied from said instruction supply unit, the 
compressed, grouped instruction being contained in 
one instruction field corresponding to one function unit 
group corresponding to said each instruction expanding 
circuit, generates a group of small instructions repre- 
sented by the grouped instruction, and outputs in par- 
allel the generated group of small instructions to the 
corresponding one function unit group; 

wherein the compressed, grouped instruction contained in 
each instruction field of each long instruction includes 
a compressed group code representative of a group of 
operation codes (OP codes) necessary for each of the 
group of small instructions represented by the grouped 
instruction, the compressed group code being repre- 
sented by the smaller number of bits than the number 
of total bits of the group of OP codes, and a group of 
operand information sets representative of at least one 
operand necessary for one of the croup of small instruc- 
tions; 

wherein each instruction expanding circuit comprises: 

a group code expanding circuit which expands the 
compressed group code in a long instruction sup- 
plied from said instruction supply unit, the com- 
pressed group code being contained in an instruction 
field corresponding to said each instruction expand- 
ing circuit, generates-a group of OP codes necessary 
for the group of small instructions represented by the 
group code, and outputs in parallel the croup of 
generated OP codes, and 

a small instruction generating unit which generates a 
croup of small instructions by combining each of the 
group of output OP codes with one of the croup of 
operand information sets, and outputs in parallel the 
group of generated small instructions to the corre- 
sponding one function unit group corresponding to 
said each instruction expanding circuit; 

wherein the compressed group code in each instruction 
field of each long instruction is set as a pointer to an 
entry of one instruction table among a plurality of 
predetermined instruction tables; 

wherein the plurality of instruction tables each register 
a plurality group of OP codes; 

wherein each instruction table is set in correspondence 
with each instruction field of a plurality of long 
instructions, said each instruction field correspond- 
ing to each of the plurality of function unit groups; 

wherein combinations of the plurality group of OP 
codes registered in each instruction table are com- 
binations of a plurality group of unique OP codes, 
among the combinations of a plurality group of OP 
codes represented by a plurality of compressed group 
codes in a plurality of instruction fields correspond- 
ing to said each instruction table and contained in the 
plurality of long instructions; and 

wherein one entry of each instruction table registers 
one combination of a group of OP codes represented 
by the group code of each instruction field contained 
in each Long instruction. 
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2. A VLIW instruction processor according to claim 1, 
wherein: 

the group code expanding circuit in each instruction 
expanding circuit includes a memory for storing com- 
binations of the plurality group of OP codes represented 
by the plurality of group codes, and in response to one 
group code supplied from said long instruction supply 
circuit, outputting one combination of a plurality of OP 
codes represented by the supplied group code, among 
the plurality group of OP codes; 

the combinations of the plurality group of OP codes stored 
in said memory are at least a portion of the combina- 
tions of the plurality group of OP codes registered in 
one of the plurality of instruction tables; and 

the one of the plurality of instruction tables is set in 
correspondence with the same instruction fields of the 
plurality of long instructions corresponding to one 
function unit group corresponding to the corresponding 
instruction expanding circuit. 

3. A VLIW instruction processor according to claim 2, 
wherein: 

the plurality of instruction tables are stored in a main 
storage connected to the VLIW instruction processor; 
and 

the VLIW instruction processor further comprises a cir- 
cuit for storing, in response to a particular load instruc- 
tion contained in the program, the combinations of the 
plurality group of OP codes belonging to one instruc- 
tion table stored in said main storage at a storage 
location designated by the particular load instruction, in 
said memory contained in one of the plurality of 
instruction expanding circuits. 

4. A VLIW instruction processor according to claim 2, 
wherein said memory in the group code expanding circuit 
contained in at least one of the plurality of instruction 
expanding circuits has a capacity sufficient for storing the 
combinations of the plurality group of OP codes registered 
in one instruction table corresponding to ones of an instruc- 
tion field in the plurality of long instructions corresponding 
to one function unit group corresponding to said at least one 
of the plurality of instruction expanding circuits. 

5. A VLIW instruction processor according to claim 2, 
wherein: 

at least one of the plurality of instruction tables includes 
a plurality of partial instruction tables set in correspon- 
dence with a plurality of partial programs of the pro- 
gram; 

the plurality of partial instruction programs are stored in 
said main storage; and 

the VLIW instruction processor further comprises a cir- 
cuit for storing, prior to executing each partial program 
of the program, the combinations of the plurality group 
of OP codes registered in one of the plurality of partial 
instruction tables stored in said main storage, into said 55 
memory contained in at least one of the plurality of 
instruction expanding circuits. 

6. A VLIW instruction processor according to claim 2, 
wherein: 

said memory of said group code expanding circuit con- 
tained in at least one of the plurality of instruction 
expanding circuits includes: 

a first memory for storing combinations of a plurality 
group of OP codes corresponding to a plurality of 
predetermined group codes; and 

a second memory for storing combinations of a plural- 
ity of other OP codes corresponding to a plurality of 
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group codes other than the plurality of predetermined 
group codes, and 
the VLIW instruction processor comprises: 

a first circuit for storing the combinations of the 

plurality of OP codes stored in said main storage, 

in said second memory; and 
a second circuit for instructing said first circuit to 

store the combinations of a plurality group of 

different OP codes during the execution of the 

program. 

7. A VLIW instruction processor according to claim 3, 
wherein said group code expanding circuit contained at least 
one of the plurality of instruction expanding circuits 
includes: 

a circuit for checking whether a combination of a plurality 
of OP codes corresponding to each group code supplied 
from said instruction supply circuit to said at least one 
instruction expanding circuit is stored in said memory 
in the group expanding circuit; and 

a circuit for storing at least the combination of the 
plurality of OP codes stored in said main storage, in 
said memory, if it is judged that the combination of the 
plurality of OP codes is not stored in said memory. 

8. A VLIW instruction processor comprising: 

a long instruction supply circuit which sequentially sup- 
plies a plurality of long instructions constituting a 
program to be executed; 

a plurality of function units which executes a plurality of 
small instructions in parallel; and 

an instruction supply unit which supplies a plurality of 
small instructions designated by a long instruction 
supplied from said long instruction supply circuit, to 
the plurality of function units, 

wherein each long instruction has a plurality of instruction 
fields each corresponding to one of the plurality of 
function units, each instruction field including at least 
one small instruction and bit information for designat- 
ing a NOP number associated with said one small 
instruction; 

wherein said NOP number represents the number of NOP 
instructions to be executed by one function unit corre- 
sponding to each instruction field which one function 
unit executed said one small instruction; 

wherein said instruction supply unit includes a plurality of 
small instruction generating circuits each provided in 
correspondence with each of the plurality of function 
units; and 

wherein each small instruction generating circuit com- 
prises: 

a selector circuit which supplies one small instruction 
associated with the bit information contained in a 
corresponding instruction field of one long instruc- 
tion supplied from said long instruction supply cir- 
cuit to one function unit corresponding to said each 
small instruction generating circuit, and after said 
one small instruction is supplied, sequentially sup- 
plies NOP instructions same in number as that des- 
ignated by the bit information contained in said 
corresponding instruction field to said corresponding 
function unit, and 

a control circuit which inhibits said selector circuit to 
supply a small instruction contained in a succeeding 
long instruction supplied from said long instruction 
supply circuit to said corresponding function unit, 
until the NOP instructions same in number as that 
designated by the bit information are supplied com- 
pletely. 
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9. A VLIW instruction processor comprising: 

a long instruction supply circuit which sequentially sup- 
plies a plurality of compressed long instructions con- 
stituting a program to be executed; 

an instruction expanding unit which expands one com- 5 
pressed long instruction supplied from said long 
instruction supply circuit and generates a plurality of 
small instructions; and 

a plurality of function units which execute the plurality of 
small instructions in parallel, 10 

wherein the plurality of function units are divided into a 
plurality of function unit groups; 

wherein said instruction expanding unit has a plurality of 
instruction expanding circuits operating in parallel and 5 
provided in correspondence with one of the plurality of 
function unit groups; 

wherein each compressed lone instruction has a plurality 
of instruction fields corresponding to one of the plu- 
rality of function unit groups, each instruction field ^ 
includes a compressed, grouped instruction represen- 
tative of a group of small instructions to be executed in 
parallel by each function unit corresponding to said 
each instruction field, the compressed, grouped instruc- 
tion being represented by a smaller number of bits than 25 
the number of total bits of the group of small instruc- 
tions; 

wherein each instruction expanding circuit expands a 
compressed, grouped instruction in a long instruction 
supplied from said instruction supply unit, the 30 
compressed, grouped instruction being contained in 
one instruction field corresponding to one function unit 
group corresponding to said each instruction expanding 
circuit, generates a group of small instructions repre- 
sented by the groused instruction, and outputs in par- 35 
allel the generated group of small instructions to the 
corresponding one function unit group; 

wherein the compressed, grouped instruction contained in 
each instruction field of each long instruction includes 
a compressed group code representative of a group of 40 
operation codes (OP codes) necessary for each of the 
group of small instructions represented by the grouped 
instruction, the compressed group code being repre- 
sented by the smaller number of bits than the number 
of total bits of the group of OP codes, and a group of 45 
operand information sets representative of at least one 
operand necessary for one of the group of small instruc- 
tions; 

wherein each instruction expanding circuit comprises: 
a group code expanding circuit which expands the 50 
compressed group code in a long instruction sup- 
plied from said instruction supply unit, the com- 
pressed group code being contained in an instruction 
field corresponding to said each instruction expand- 
ing circuit, generates a croup of OP codes necessary 55 
for the group of small instructions represented by the 
group code, and outputs in parallel the group of 
generated OP codes, and 
a small instruction generating unit which generates a 
group of small instructions by combining each of the 60 
group of output OP codes with one of the group of 
operand information sets, and outputs in parallel the 
group of generated small instructions to the corre- 
sponding one function unit group corresponding to 
said each instruction expanding circuit; 65 
wherein the compressed, grouped instruction contained 
in each instruction field of each long instruction 
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further includes a plurality of NOP numbers each 
corresponding to each of a plurality of small instruc- 
tions represented by the compressed group code in 
the grouped instruction; 

wherein each NOP number represents the number of 
NOP instructions to be executed by one function unit 
which executed one small instruction corresponding 
to each NOP number; 

wherein said small instruction supply unit in each 
instruction expanding circuit includes a plurality of 
small instruction generating circuits provided in cor- 
respondence with respective function units of the 
function unit group corresponding to said each 
instruction expanding circuit; and 

wherein each small instruction generating circuit 
includes: 

a selector circuit which generates one small instruc- 
tion by combining one of the group of OP codes 
generated by said group code expanding circuit of 
each instruction expanding circuit, with respect to 
the compressed group code in one grouped 
instruction in one long instruction supplied from 
said long instruction supply circuit, with one of 
the group of operand information sets in said one 
grouped instruction, supplies said generated one 
small instruction to one function unit correspond- 
ing to said each small instruction generating 
circuit, and after said one small instruction is 
supplied, sequentially supplies NOP instructions 
same in number as that designated by the corre- 
sponding NOP number among the group of NOP 
numbers to the function unit corresponding to said 
each small instruction generating circuit, and 

a control circuit which inhibits each small instruction 
generating circuit to supply a small instruction to 
the corresponding function unit, the small instruc- 
tion being supplied from said each instruction 
expanding circuit after the group of OP codes and 
containing one of a group of succeeding OP codes 
generated from a long instruction succeeding said 
one long instruction, until the NOP instructions 
same in number as that represented by said cor- 
responding NOP number are supplied completely. 
10. A VLIW instruction processor according to claim 9, 
wherein: 

each instruction expanding circuit further includes a plu- 
rality of OP code queues provided in correspondence 
with a plurality of function units of each function unit 
group corresponding to said each instruction expanding 
circuit; 

each OP code queue is connected to said group code 
expanding circuit and one of the plurality of small 
instruction generating circuits, stores a corresponding 
one of a plurality of OP codes each time the plurality 
of OP codes are output from the group code expanding 
circuit of each instruction expanding circuit, with 
respect to the group code in one grouped instruction of 
one compressed long instruction supplied from said 
long instruction supply circuit to said each instruction 
expanding circuit, sequentially stores a plurality of OP 
codes supplied from the group code expanding circuit, 
and sequentially outputs in the storage order the plu- 
rality of stored OP codes to said one small instruction 
generating circuit connected to said each OP code 
queue; and 

said control circuit of each small instruction generating 
circuit has a circuit for instructing one OP code queue 
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among the plurality of OP code queues connected to 
said each small instruction generating circuit to output 
the next OP code synchronously with a completion of 
supplying NOP instructions same in number as that 
designated by said corresponding NOP number. s 

11. A VLJW instruction processor according to claim 10, 
wherein: 

each instruction expanding circuit further comprises: 

a group code queue connected to said long instruction 
supply circuit and one of the plurality of group code 10 
expanding circuits; 

a group of operand queues each connected to said long 
instruction supply circuit and one of the plurality of 
small instruction generating circuits in said each 
instruction expanding circuit; and 15 

a group of NOP numbers each connected to one of the 
plurality of small instruction generating circuits in 
said each instruction expanding circuit; 

said group code queue stores a plurality of compressed 
group codes each contained in each of a plurality of 20 
instruction fields corresponding to the function unit 
group corresponding to each instruction expanding 
circuit, the plurality of compressed group codes 
contained in a plurality of long instructions sequen- 
tially supplied from said long instruction supply is 
circuit, and outputting in the storage order the plu- 
rality of stored, compressed group codes to said one 
group code expanding circuit; 

each of the group of operand queues stores a group of 
operand information sets to be used by a plurality of 30 
small instructions to be executed by said one func- 
tion unit group corresponding to said each instruc- 
tion expanding circuit, the group of operand infor- 
mation sets being contained in a plurality group of 
operand information sets contained in the plurality of 35 
instruction fields of the plurality of long instructions, 
and sequentially supplying the plurality of stored 
operand information sets to the small instruction 
generating circuit connected to said each operand 
queue; 40 

each of the group of NOP number queues stores a group 
of NOP numbers to be used by a plurality of small 
instructions to be executed by said one function unit 
group corresponding to said each instruction expand- 
ing circuit, the group of NOP numbers being con- 45 
tained in a plurality group of NOP numbers con- 
tained in the plurality of instruction fields of the 
plurality of long instructions, and sequentially sup- 
plying the plurality of stored NOP numbers to the 
small instruction generating circuit connected to said 50 
each NOP number queue; and 

said control circuit in each of the plurality of small 
instruction generating circuits in said instruction 
expanding circuit includes a circuit for instructing 
one operand queue among the group of operand 55 
queues connected to said each small instruction 
generating circuit and one NOP number queues 
among the group of NOP number queues connected 
to said each small instruction generating circuit to 
output the next operand and the next NOP number 60 
synchronously with a completion of supplying one 
or a plurality of NOP instructions by said selector 
circuit of said each small instruction generating 
circuit 

12. A VLIW instruction processor according to claim 11, 65 
wherein each instruction expanding circuit further includes 

a circuit connected to a plurality of control circuits in the 



plurality of small instruction generating circuits contained in 
said each instruction expanding circuit, said circuit request- 
ing said group code expariding circuit to supply the next 
group code to said group code expanding circuit in response 
to a request issued from one of the plurality of control 
circuits for supplying the next operand and the next NOP 
number. 

13. A VLIW instruction processor according to claim 11, 
wherein each instruction expanding circuit comprises: 

a register which stores a specific group code; and 
a comparator connected to said group code queue, to the 
plurality of operand queues, and to the plurality of NOP 
number queues, which compares the specific group 
code stored in said register with a group code contained 
in the instruction field of one long instruction supplied 
from said long instruction supply circuit, 
wherein if a comparison result by said comparator indi- 
cates a coincidence of both the codes, said group code 
queue, the plurality of operand queues, and the plurality 
of NOP number queues do not receive the group code. 

14. A VLIW instruction processor according to claim 9, 
wherein each instruction expanding circuit comprises: 

a register which stores a specific group code; and 
a comparator which compares the specific group code 
stored in said register with a group code contained in 
the instruction field of one long instruction supplied 
from said long instruction supply circuit, 
wherein if a comparison result by said comparator indi- 
cates a coincidence of both the codes, reception of the 
group code is inhibited. 

15. A long instruction expanding method for a VLIW 
instruction processor which executes a program composed 
of a plurality of long instructions obtained by compressing 
a plurality of small instructions to be executed in parallel by 
a plurality of function units, each long instruction having a 
plurality of grouped instructions each obtained by compress- 
ing a plurality of small instructions to be executed in 
parallel, for each of a plurality of function unit groups 
obtained by dividing the plurality of function units, the 
method comprising the steps of; 

storing in advance a plurality of combinations of a plu- 
rality of small instructions to be executed by each of the 
plurality of function groups, in at least one instruction 
table memory; 

sequentially storing long instructions composing the pro- 
gram in an instruction expanding unit; 

deriving each grouped instruction of each long instruc- 
tion; 

by using the contents of each derived grouped instruction 
as a pointer value, reading a combination of small 
instructions corresponding to the grouped instruction 
from the instruction table memory; 

generating a plurality of small instructions for each 
derived grouped instruction to be executed in parallel, 
in accordance with the read combination of small 
instructions and the derived grouped instruction; and 

outputting the plurality of generated small instructions for 
each grouped instruction to a plurality of function units 
of the corresponding function unit group, 

wherein the number of bits of each grouped instruction is 
smaller than the number of total bits of the small 
instructions which were compressed to generate the 
grouped instruction. 

16. A method according to claim 15, further comprising 
the step of reading a plurality of combinations of a plurality 
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of small instructions from a main storage connected to the 
VLIW processor and loading the read combinations in the 
instruction table memory, in accordance with the contents of 
the derived grouped instruction, if the derived grouped 
instruction is a preset load instruction. 

17. A long instruction expanding method for a VLIW 
instruction processor which executes a program composed 
of a plurality of long instructions obtained by compressing 
a plurality of small instructions to be executed in parallel by 
a plurality of function units, each long instruction having a 
plurality of grouped instructions each obtained by compress- 
ing a plurality of small instructions to be executed in 
parallel, for each of a plurality of function unit groups 
obtained by dividing the plurality of function units, each 
grouped instruction having bit information representative of 
the number of NOP instructions to be executed continuously 
after a corresponding small instruction for each function unit 
is executed, the method comprising the steps of: 

storing in advance a plurality of combinations of a plu- 
rality of small instructions to be executed by each of the 
plurality of function groups, in at least one instruction 
table memory; 
sequentially storing long instructions composing the pro- 
gram in an instruction expanding unit; 
deriving each grouped instruction of each long instruc- 
tion; 

by using the contents of each derived grouped instruction 
as a pointer value, reading a combination of small 
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instructions corresponding to the grouped instruction 
from the instruction table memory; 
generating a plurality of small instructions for each 
derived grouped instruction to be executed in parallel, 
in accordance with the read combination of small 
instructions and the derived grouped instruction; 
outputting the plurality of generated small instructions for 
each grouped instruction to a plurality of function units 
of the corresponding function unit group; and 
outputting NOP instructions same in number as that 
designated by the corresponding bit information, for 
each grouped instruction, to a plurality of function units 
of the corresponding function unit group, 
wherein the number of bits of each grouped instruction is 
smaller than a sum of the number of total bits of the 
small instructions which were compressed to generate 
the grouped instruction, and the number of bits of the 
NOP instruction. 
18. A method according to claim 17, further comprising 
the step of reading a plurality of combinations of a plurality 
of small instructions from a main storage connected to the 
VLIW processor and loading the read combinations in the 
instruction table memory, in accordance with the contents of 
the derived grouped instruction, if the derived grouped 
instruction is a preset load instruction. 
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